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(54) Scene change detector for digital video 

(57) In a method for detecting a scene change 
between a prior video picture and a current video pic- 
ture of a sequence of pictures, an average luminance 
value is determined for a block pair of the prior and cur- 
rent video pictures. Preferably, the blocks of the block 
pair are located, respectively, in the same relative posi- 
tion in the prior and current pictures. An incremental vis- 
ual sensation value is determined using a difference 
between the average luminance values. If the incremen- 
tal visual sensation value exceeds a block contrast 
threshold level, a scene change is indicated. In particu- 
lar, if the minimum of the average luminance values of 



the current and prior picture blocks exceeds a dark 
scene threshold, the incremental visual sensation value 
is determined using the ratio of (a) the absolute value of 
the difference between the average luminance values, 
and (b) the minimum of the average luminance values of 
the current and prior picture blocks. Otherwise, the 
incremental visual sensation value is determined using 
the ratio of (a) the absolute value of the difference, and 
(b) the dark scene threshold. The method may be opti- 
mized by adjusting the block size based on the relative 
amount of motion and the current picture type. 
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Description 

BACKGROUND OF THE INVENTION 

5 vkl JvZ« 7?^™ ?'?' 63 ^ a ? appaate 3nd ** detectin9 scene <*am>es in a sequence of digital 

^ZrZi 9 , P ^ H PartCUter ' Cha " 9es in ,uminance in di « erent blocks of a video frame relative to corre- 
sponding blocks in a previous frame are used to provide a reliable indication of a scene change 

Recently, digital video transmission formats have become increasingly popular for providing television and other 
a^v^andyordatasem 

offerings. d.grtal video can provide a higher image quality than traditional analog television broadcast 

In order to transmit digital video signals within available bandwidths. it is necessary to use data compression tech- 
no, """l ^ mpreSSi ° n ,6ChniqueS teke advanta9e of batons between neiSnnTS^or 
"^ofpocels.™^ 

SoT CeSS ' Ve M ° reWer ' m ° ti0n C ° mPenSat0n •-**«- «" ««"*• grater ,empo^clp:es 

k . J!!^ " " d * *° Pr0Vide ° ptimal com P ression ° f a s «1"ence of video frames, it is desirable to have the caoa- 
bihty to detect scene changes in the sequence. A scene change can be defined genera.ly as any signHicantchan^ 

20 era anglers changed or when there ,s a switch between a close-up view and a panoramic view Moreover often toi 
ascenechange.srnd.catedbyanoticeab.e change in the luminance level between successive video ^^0^ 
pie, a scene change may occur when a bright light is switched on in a dark room 

P «^ 6 ^ Cene Cha09e T b6en d6,eCted ' 6nC0ding 0f ** vkteo se ^ uence ^ be modified accordingly. For 
example, motron compensate may be temporarily suspended when a scene change is detected since thesis a 

sceneMoreover. a specrfrc type of p 1C ture (eg.. I. P. or B picture) may be selected based on scene change information 
I. P and B pictures are def.ned by the MPEG-2 standard as discussed in greater detail below 'r«°rmat.on. 
Vanous existing scene detection systems attempt to provide reliable scene change detection For examole one 

1£Z ST- r 01 2 abS0 ' Ute 0< the mere ™ °* responding pixe. vaLs be£een Z Sto 
and the previous frame, and compares this sum with a predetermined constant threshold to determine whether 

ISSfT SySt6m ^ fail 10 9iVe re ' iab,e reSU,ts " a fast •**» «™ h mgJSSm. fI* 

faHX) ' S COnS ' S,en,ly reliab ' e differem leV6,S ° f m0fi0n Presen1 (e 9 ' ^eratefy fi 

Another system determines the absolute value of the difference between conesponding pixel values between the 
current frame and the previous frame. Then, the absolute value of the difference^ ZS^Z 2S Z£s 

TIT^T^™ t B 06X1 ^ 15 determined ,he sum * *• dWerenceX aZe two d«^ 
S^J * T^* 3 P redetermined ^reshold to determine whether there is a scene 

Accordmgly. rt would be desirable to provide a scene detection system for digital video which can reliaWydetect 

S£ SS? £S fV^EfS W f! xisfin9 di9tel video encodin 9 standards including the Motion Picture 
2?£l « { deta " S 01 Wh ' Ch c 3 " °e found in document ISO/IEC JTC1/SC29/WG1 1 N0702 enti- 

SLESFS? TeChn0l£ ^ - Q9neric ^"9 «* M ™"9 Pictures and Associated Audio. Recommendation H.262." 
March 25, 1994, incorporated herein by reference. 

cr^nSp^T i ? ( *! d fK tt !. ree tyP6S 01 Vide ° Pictures; a »"«* a *y. the intra-coded picture (l-picture), predictive- 
S ( * 6) ' b,< " rectonal, y Predictive-coded picture (B-picture). Furthermore. either frame or field 

^ P T^!r^ U r CeS ^ eaCC ° mm0da,ed '■»**" ""P"* 8 * describes a «ideo picture without refer- 
ence to any other picture. For improved error concealment motion vectors can be included with an l-picture An error 
man picture has the potential for greater impact on the displayed video since both P-pictures and B-pictures in the 
base layer are predicted from 1-p.ctures. P pictures are predicted based on previous . orP pictures. Tne referenced 

£TJ£r. or p *T t0 J Tf p - picture and " ^ 35 fonward « 

closest earlier I or P picture and the closest later I or P picture 

nira ^ a ^ a9eOUS sc fe detection system would also provide automatic control of the rate control and adaptive 
wr Z^T 8 °i Wde0 COmpreSSi0n encoders "» various standards, including MPEG-1. MPEG-2 
ISO/IEC K261 (vdexonferencng). and ISO/IEC H.263. Moreover, the system should also be compatible with vark^ 
cotor te.evis.on broadcast ^standards such as me National Televiston Standards Committee (NTScfstariard ^ 

bST fr ^ Si r M " na&]9 Une (PAL) Standard " used in Eur °P* ^ Should further be cor^aiibS wim 
both frame and feld mode vxJeo. The present invention provides a system having the above and olher aZitages 
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SUMMARY OF THE INVENTION 

In accordance with the present invention, a method and apparatus are presented for detecting a scene change 
between a prior picture and a current picture in a sequence of video pictures. 

5 In a method for detecting a scene change between a prior video picture and a current video picture, an average 
luminance value is determined for a block pair of the prior and current video pictures. Preferably, the blocks of the block 
pair are located, respectively, in the same relative position in the prior and current pictures. Next, an incremental visual 
sensation value is determined using a difference between the average luminance values. If the incremental visual sen- 
sation value exceeds a block contrast threshold level, a scene change is indicated. The block contrast threshold level 

10 may be approximately fifteen to twenty-five times a Weber fraction constant defined herein. 

In particular, a minimum of the average luminance values of the current and prior picture blocks is determined, 
where, if the minimum exceeds a dark scene threshold, the incremental visual sensation value is determined using the 
ratio of (a) the absolute value of the difference between the average luminance values, and (b) the minimum of the aver- 
age luminance values of the current and prior picture blocks. Otherwise, the incremental visual sensation value is deter- 

15 mined using the ratio of (a) the absolute value of the difference, and (b) the dark scene threshold. The dark scene 
threshold may be approximately 10% of a maximum gray level. 

Additionally, the difference between the average luminance values may be determined for a plurality of block pairs 
of the prior and current video pictures. Preferably, every block pair in the pictures are used to provide an overall picture 
scene change determination. The incremental visual sensation value is determined for each of the block pairs using the 

20 differences, where, if the incremental visual sensation value exceeds the block contrast threshold level for a threshold 
proportion of block pairs in the current and prior video pictures, a scene change is indicated. This threshold proportion 
may be approximately 80% to 90%. 

Furthermore, the method may be adaptively optimized by determining a relative amount of motion between the 
blocks of the block pair, and adjusting a size of the blocks based on the relative amount of motion. In particular, the size 

25 of the blocks is increased as the relative amount of motion increases. Moreover, the relative amount of motion can be 
found by determining a sum of the absolute value of a horizontal motion vector and the absolute value of a vertical 
motion vector, where the horizontal and vertical motion vectors are indicative of horizontal and vertical motion, respec- 
tively, of a video image of the current picture block relative to a video image of the prior picture block. A determination 
is then made to see if the sum exceeds a motion threshold. The motion threshold may be adjusted according to a pic- 

30 ture type of the current picture (e.g., whether the current picture is an I, P or B picture). 
A corresponding apparatus is also presented. 

BRIEF DESCRIPTION OF THE DRAWINGS 

35 FIGURE 1 illustrates a comparison between blocks of two consecutive video frames in accordance with the present 
invention. 

FIGURE 2 is a block diagram of a scene change detector in accordance with the present invention. 
DETAILED DESCRIPTION OF THE INVENTION 

40 

A method and apparatus are presented for detecting scene changes in a sequence of digital video frames. 

The brightness level of a scene is determined by the average luminance of the pixels which comprise the scene, 
and the dynamic range of the luminance values of the pixels. Moreover, the visual sensation of brightness to the human 
eye is generally considered to be a function of the natural logarithm of image luminance. At a frame and/or field of a 
45 scene change, the visual sensation of brightness is changed significantly from the previous frame or previous corre- 
sponding field. 

Furthermore, since human perception is more sensitive to a luminance contrast rather than the absolute luminance 
values themselves, the incremental visual sensation 6C between two scenes is a good indicator of a scene change. SC 
is defined as the differential value of the average brightness of a region (e.g., block) which has the same relative position 
so in the two frames and/or fields. 

In particular, according to Weber's law, if the luminance b 0 of an object is just noticeably different from the lumi- 
nance b $ of a surrounding region, then the following ratio known as the Weber fraction can be defined: 

. = C = constant. 



The Weber fraction remains approximately constant at high background luminance levels, e.g., greater than 0.5 mL 
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(milliLumens). The value of the constant C has been found to be 0.02, which means that on an scale of 0 to 1 , at least 
fifty different luminance levels are required for the contrast between levels to be perceptible by a human. 

Denoting b 0 = o„ one can write b s = 6 + 66, where 6b is the smallest perceptible luminance change 
Then, 



^ * d ( ,0 9 G b )= sc (constant) 



io which indicates that 

15 

is proportional to the incremental visual sensation of brightness. 

FIGURE 1 illustrates a comparison between blocks of two consecutive video frames in accordance with the present 
invention. A current frame, Frame (i), shown at 100, includes a block 1 10. A previous frame. Frame (i-1), shown at 150 
includes a block 1 60 which is located in the same relative position in the frame 1 50 as block 11 0 is located in frame 1 00 
For instance, with an NTSC format, the frames 100 and 150 may each comprise thirty slices, with each slice having 
forty-tour macroblocks. Thus, an entire NTSC frame comprises 1.320 macroblocks. Moreover, a macroblock typically 
comprises a 16 x 16 block of pixels which, in the MPEG-2 standard, for example, is comprised of four 8 x 8 pixel blocks 
Thus, an NTSC frame may comprise 44 x 16 = 704 pixels in width, and 30 x 16 = 480 pixels in height, for a total of 
337.920 pixels. Furthermore, the present invention is compatible with the PAL format, which includes 1 584 macrob- 
locks in 36 slices, with 44 macroblocks per slice, and 16 x 16 pixels per macroblock. 

Blocks 1 10 and 160 are designated by the coordinate set (k.l). where k is the horizontal index of the block and I is 
the vertical index. Furthermore, each of the blocks 110 and 160 may have a size, for example, of 16 pixels in height by 
32 pixels in width. In this case, k will range from 1 to 704/32=22, and I will range from 1 to 480/16=30 The followina 
terms are defined: * 



20 



25 



30 



h height of frame (pixels) 

w width of frame (pixels) 

m height of block (pixels) 

n width of block (pixels) 

35 i frame index 

k horizontal block index (k=l h/m) 

I vertical block index (1=1 , .... w/n) 

Xj W pixel intensity of ith frame, kth horizontal block, Ith vertical block 



40 



55 



Thus, we have two consecutive frames and/or two top (or bottom) fields which are defined by a set of pixels In par- 
ticular, the (i)th frame, frame 100. is defined by a set of pixels X- w> and the (i-l)th frame, frame 150, is defined by a set 
of pixels X M k ,. In order to effectively distinguish a scene change, each frame is partitioned into a set of k x 1 disjoint 
Wooks, with each block having m x n pixels. 

Note that the size of the block can be programmed to adaptively change based on the current motion information. 
In particular, the faster the motion is, the larger the block size m x n should be. One way to adjust the block size for each 
frame based on the amount of motion is by performing pre-processing as follows. First an index v[x][y] is computed for 

f 3 ^, 1 ,?? 16 maCr ° b,OCK whew X=1 ' 2 ' - * w/16 l' y= 1 - 2 » the full pixel forward motion vector, vec- 

tor[x][y][z], satisfies the following inequality: 

|vecton:x][y][0]|4|vectortxI[y][1]|>T3, 

then a fast motion between the two blocks is indicated. Vector[x][y][0] and vectortx][y][1] are the horizontal and vertical 
motion vectors, respectively, of a current frame block (e.g.. block (x.y)) relative to a prior frame block Thus if the ine- 
quality is met, set the index v[x][y]=l , otherwise, set vfx][y]=0 . 

Note that the motion vectors vector{x][y][z] are obtained from the closest available picture with the same picture 
type. For example, if the current picture type is a P-picture. then motion vectors vectorfx][y][zJ are motion vectors of the 
previous predicted P-p«cture. This is true since the scene change detection for each picture occurs before the motion 
estimation of the picture. 
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The threshold T 3 is selected based on the different picture types which are present in the sequence of video 
frames. For example, rf there are no B-pictures in the bitstream, e.g., with the sequence I, P, P. .... then T 3 =16 is an 

appropriate choice. If there is one B-picture present, e.g., with the sequence P, B, P, B then T 3 =16 is an appropriate 

choice if the current picture is a B-picture, and T 3 =32 is an appropriate choice rf the current picture is a P-picture. and 
5 so forth. 

Next, the block size is adjusted accordingly. An initial (default) block size of 16 x 16 may be used. Then, the block 
size may be adjusted based on vfx][y]. For example, if v[x][y]=1 , then the block size may be increased, e.g.. to 16 x 32 
or 32 x 32. Similarly, if v[x][y]=0 , then the block size may be decreased, e.g., to 8 x 16. However, note that the block 
size should not be increased such that the block crosses over the right and/or bottom boundary of a macroblock. Gen- 
ie erally, the block size should be larger when the motion is faster. Moreover, the largest allowed block size may be limited 
in some applications to 32 x 48 pixels. 

Next, the average luminance of each block in a frame (or top field) is determined. For the (i)th frame, block (k,l), the 
average luminance is: 

15 m-1 n-1 

B i,k,l = jj^jZ Z X i.kh+c1,lw+c2' 
c1=Oc2=0 



20 for k=1, ... h/m, and 1=1 w/n. d and c2 are dummy counting indexes. Next, the block-luminance-increment $B i k! 

between the (i)th and (i-1) frames (or top fields) is determined by: 

25 Furthermore, the relative block-incremerrtal-contrast 5C W for the (i)th frame, block (k,l). can be defined by: 



30 



\SBi 



\SB ikl \ 
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T 0 is a threshold which indicates that a scene is considered to be a dark scene. Generally, T 0 =25.5 may be used, which 
is 10% of the maximum gray scale level 255. 

For a scene change, a significant threshold T 1 of the relative change of block luminance contrast is set as: 
7 1 =15~25C. Now, consider an index array. index[k]p] t for k=1 h/m, 1=1, .... w/n. defined by: 



fl>if5C iJc , >Tj. 



Then, if approximately 80-90% of the blocks in a frame have a relative block-incremental-cbntrast which is greater than 
the significant threshold, i.e., 

I h/ mil w/n} 
£ Jjndex{k][l] 

*?* , /=1 , ; — : — j — > 7*2 > where T 2 = *Q%~ 90%, 



then, in accordance with the present invention, a scene change is indicated. The range of 80-90% was determined 
based on extensive testing, but the actual optimal figure may vary with the particular scene. Note that the mathematical 
expression x J denotes rounding of the non-integer x to the next lowest integer. 
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FIGURE 2 is a block diagram of a scene change detector in accordance with the present invention. The detector, 
shown generally at 200. includes input terminals 205 and 210. At input terminal 205. pixel data from the current frame! 
Xj k Is received and provided to a block average function 215 to produce the average luminance value for each block 
in the ith frame. B i k Similarly, at input terminal 210. pixel data from the previous frame. X i . 1 k , is received and provided 
to a block average function 220 to produce the average luminance value for each block in the (M)th frame. B M k 

Minimizer unit (MIN) 225 determines min{Bj kl , B Mk ,) and outputs this term to a divisor function 230. Meanwhile, 
subtracter 235 determines 5 B iXi = B ikr B^ kh Absolute value function 240 determines |8 B s k /|, and provides this term 
to the divisor 230. The divisor 230 determines the relative block-incremental-contrast 60,^, for the (i)th frame, block 
(k,l), depending on whether minfS^,, B M A J>T 0 . bC ikJ is then provided to a threshold function 235 which determines 
whether block (k,l) is indicative of a scene change (e.g.. whether SC ikJ >T v If so. an index[k][l] may be set accordingly. 
Accumulator 240 accumulates the scene change result for each block, and sums the result over the entire frame or a 
portion thereof. Finally, threshold function 250 receives the summed result from accumulator 240. and uses the thresh- 
old T 2 to determine whether a scene change for the overall frame is indicated. 

The scene change detection system of the present invention was tested extensively using different video 
sequences. In particular, the "Football". "Mobile Calendar". "Flower Garden", and "Table Tennis" video sequences 
described in Test Model Editing Committee. "Test Model 5". ISO/IEC JTC1/SC29/WG1 1 MPEG93/457. April 1 993. were 
analyzed, along with the "Destruct". "Street Organ". "Silent", and "Fun Fair" video sequences, described in the Ad hoc 
group on MPEG-4 video VM editing, "MPEG-4 Video Verification Model Version 3.0", ISO/IEC JTC1/SC29/WG11 
N1277, Tampere. Finland. July 1996. 

Sample test results of the scene detection system of the present invention are shown in Table 1 , below. The thresh- 
olds were selected as T 1= 0.3 and T 2 =0.85, and the block size was m=16, n=32. The particular video sequence is iden- 
tified in the first column. The sequence of frames involved is indicated in the second column. For example, [0:50] 
indicates that frames 0 through 50 were analyzed for a scene change. The third column indicates whether a scene 
change was detected, and if so, in which frames. For example, a scene change in a third frame means that a scene 
change between the second and third frames was detected. The fourth column, if applicable, provides additional infor- 
mation on the nature of the video sequence. 



Table 1 



Sequences 


Frames 


Scene Change 


Comments 


Football 


[0:50] 


No 


Fast motion 


Mobile Calendar 


[0:44] 


No 




Street Organ 


[0:50] 


No 




Silent 


[0:50] 


No 


No motion to motion 


Flower Garden 


[0:30] 


No 


Camera panning 


Fun Fair 


[0:50] 


No 


Fast motion 


Table Tennis 


[90:100] 


97th 




Destruct 


[0:40] 


25th.26th 


A bright light 


Combination of any two sequences 




Yes 


scene change detected every time 



Moreover, for frames in the above video sequences in which a scene change was detected, coding efficiency was 
examined using the MPEG-2 WG-1 1 programs. Coding efficiency is measured by determining the number of bits gen- 
erated by the coding method to achieve a given image quality. Specifically, for a constant quality level, fewer coding bits 
are indicative of higher coding efficiency. It was determined that, if a frame with a scene change detected in accordance 
with the present invention is coded as a P-picture type, then more than 46% of the macroblocks are coded as l-pictures. 
Thus, the scene detection system of the present invention was found to operate as expected since it successfully 
located frames which are difficult to efficiently code using predictive coding. Advantageously, such frames can be coded 
as l-pictures since the rate control engine of the encoder allocates more bits for l-pictures. thereby also improving image 
quality. 

Generally, in a typical encoder, such as an MPEG-2 encoder using the Test Model 5 standard, there is a process 
for selecting a macroblock coding type for each macroblock of a P-picture or B-picture. Such a process will determine 
whether the macroblock should be coded as an intra<oded block (l-coded) or non-irrtra-coded block (P- or B-coded) 
based on which coding type provides better quality or uses fewer data bits. For a P-picture or B-picture. if the prediction 
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is efficient, only a small proportion of macroblocks in a picture will be 1-coded (e.g., less than five per cent). This is desir- 
able as l-coded blocks consume a relatively large number of data bits since there is no temporal compression. 

If the proportion of l-coded macroblocks in a picture is greater than, e.g., thirty or forty per cent, then the picture 
quality will be poor. In this case, prediction coding is inefficient for the picture, as would be expected at a scene change. 
Thus, when a scene change occurs, it is generally desirable that the first frame of the new scene should not be coded 
as a P-picture. 

Although the invention has been described in connection with various specific embodiments, those skilled in the art 
will appreciate that numerous adaptations and modifications may be made thereto without departing from the spirit and 
scope of the invention as set forth in the claims. For example, the various threshold levels set forth herein may be 
adjusted according to the particular scene or video sequence which is analyzed. That is, some types of video 
sequences, such as action movies, may be characterized by more frequent and pronounced scene change activity. 
Moreover, specific lighting conditions may be associated with a particular video sequence, e.g., such as a horror film, 
where lighting levels may be relatively low throughout the sequence. In this case, the scene change detection thresh- 
olds can be adjusted accordingly. 

Moreover, it may be desirable to analyze only a portion of a video picture to determine a scene change, or different 
portions may be analyzed using different thresholds. For instance, in a video sequence of a landscape scene with a rel- 
atively dark earth at the bottom part of the picture and a relatively bright sky at the top part of the picture, a more sen- 
sitive scene change threshold may be used for the bottom part of the picture. Similarly, different sized blocks may be 
used in different regions of a picture. For instance, when motion is more prevalent toward the middle of a picture than 
toward the edges, larger block sizes may be used in the middle of the picture. 

Claims 

1 . A method for detecting a scene change between a prior video picture and a current video picture, comprising the 
steps of: 

determining average luminance values of a block pair of said prior and current video pictures; and 
determining an incremental visual sensation value using a difference between said average luminance values; 
wherein: 

if said incremental visual sensation value exceeds a block contrast threshold level, a scene change is indi- 
cated. 

2. The method of claim 1 , wherein said block contrast threshold level is approximately fifteen to approximately twenty- 
five times a Weber fraction constant. 

3. The method of claim 1 or 2, wherein said blocks of said block pair are located, respectively, in the same relative 
position in said prior and current pictures. 

4. The method of one of the preceding claims, comprising the further step of: 

determining a minimum of said average luminance values of said current and prior picture blocks, wherein: 

if said minimum exceeds a dark scene threshold, said incremental visual sensation value is determined 
using the ratio of (a) the absolute value of said difference, and (b) said minimum; 
else, said incremental visual sensation value is determined using the ratio of (a) the absolute value of said 
difference, and (b) said dark scene threshold. 

5. The method of claim 4, wherein: 

said dark scene threshold is approximately 10% of a maximum gray level. 

6. The method of one of the preceding claims, wherein: 

said difference between average luminance values is determined for a plurality of block pairs of said prior and 
current video pictures; and 

said incremental visual sensation value is determined for each of said block pairs using said differences; 
wherein: 
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if said incremental visual sensation value exceeds the block contrast threshold level for a threshold propor- 
tion of block pairs in said current and prior video pictures, a scene change is indicated. 

7. The method of claim 6, wherein said threshold proportion is approximately 80% to approximately 90%. 

8. The method of one of the preceding claims, comprising the further step of: 

determining a relative amount of motion between said blocks of said block pair; and 
adjusting a size of said blocks based on said relative amount of motion. 

9. The method of claim 8, wherein the size of said blocks is increased as said relative amount of motion increases. 

1 0. The method of claim 8, wherein said step of determining a relative amount of motion comprises the further steps of: 

determining a sum of the absolute value of a horizontal motion vector and the absolute value of a vertical 
motion vector; 

wherein said horizontal and vertical motion vectors are indicative of horizontal and vertical motion, 
respectively, of a video image of said current picture block relative to a video image of said prior picture block' 
and 

determining if said sum exceeds a motion threshold. 

11. The method of claim 10, wherein: 

said motion threshold is adjusted according to a picture type of said current picture. 

12. An apparatus for detecting a scene change between a prior video picture and a current video picture, comprising; 

means for determining average luminance values of a block pair of said prior and current video pictures; and 
means for determining an incremental visual sensation value using a difference between said average lumi- 
nance values; wherein: 

if said incremental visual sensation value exceeds a block contrast threshold level, a scene chanqe is indi- 
cated. 

13. The apparatus of claim 12, wherein said block contrast threshold level is approximately fifteen to approximately 
twenty-five times a Weber fraction constant. 

14. The apparatus of claim 12 or 13, further comprising: 

means fa determining a minimum of said average luminance values of said current and prior picture blocks, 
wherein: 

if said minimum exceeds a dark scene threshold, said incremental visual sensation value is determined 
using the ratio of (a) the absolute value of said difference, and (b) said minimum; 
else, said incremental visual sensation value is determined using the ratio of (a) the absolute value of said 
difference, and (b) said dark scene threshold. 

15. The apparatus of one of claims 12 to 14. further comprising: 

means for determining said difference between average luminance values for a plurality of block pairs of said 
prior and current video pictures; and 

means for determining said incremental visual sensation value for each of said block pairs using said differ- 
ences; wherein: 

if said incremental visual sensation value exceeds the block contrast threshold level for a threshold propor- 
tion of block pairs in said current and prior video pictures, a scene change is indicated. 

16. The apparatus of claim 15. wherein said threshold proportion is approximately 80% to approximately 90%. 
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17. The apparatus of one of claims 12 to 16. further comprising: 

means for determining a relative amount of motion between said blocks of said block pair; and 
means for adjusting a size of said blocks based on said relative amount of motion. 

5 

18. The apparatus of claim 17, further comprising: 

means for increasing the size of said blocks as said relative amount of motion increases. 

w 19. The apparatus of claim 1 7, wherein said means for determining a relative amount of motion further comprises: 

means for determining a sum of the absolute value of a horizontal motion vector and the absolute value of a 
vertical motion vector; 

wherein said horizontal and vertical motion vectors are indicative of horizontal and vertical motion, 
75 respectively, of a video image of said current picture block relative to a video image of said prior picture block; 

and 

means for determining if said sum exceeds a motion threshold. 
20. The apparatus of claim 19, further comprising: 

20 

means for adjusting said motion threshold according to a picture type of said current picture. 
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(54) Scene change detector for digital video 

(57) In a method for detecting a scene change 
between a prior video picture and a current video pic- 
ture of a sequence of pictures, an average luminance 
value is determined for a block pair of the prior and cur- 
rent video pictures. Preferably, the blocks of the block 
pair are located, respectively, in the same relative posi- 
tion in the prior and current pictures. An incremental vis- 
ual sensation value is determined using a difference 
between the average luminance values, if the incremen- 
tal visual sensation value exceeds a block contrast 
threshold level, a scene change is indicated. In particu- 
lar, If the minimum of the average luminance values of 
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the current and prior picture blocks exceeds a dark 
scene threshold, the incremental visual sensation value 
is determined using the ratio of (a) the absolute value of 
the difference between the average luminance values, 
and (b) the minimum of the average luminance values of 
the current and prior picture blocks. Otherwise, the 
incremental visual sensation value is determined using 
the ratio of (a) the absolute value of the difference, and 
(b) the dark scene threshold. The method may be opti- 
mized by adjusting the block size based on the relative 
amount of motion and the current picture type. 
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